Supporting files for Blau, Kahn and Papps, "Gender, source country characteristics, and labor market assimilation among immigrants." Review of Economics and Statistics, February 2011, 93(1): 43�58


The files are designed to be used with Stata Version 11 for Windows.
Below, YYYY takes the values 1980, 1990 and 2000.


The data files consist of:

"Census YYYY data.dta" files - contain Stata format versions of the 5% extracts from the 1980, 1990 and 2000 US Censuses, as accessed via IPUMS. All variables are defined at http://usa.ipums.org/usa-action/variables/group
"Country group codes.dta" - contains information on the grouping of countries/territories over time.
"finaldata.dta" - contains country-year level data from the World Bank and UN, as defined in the data appendix to the paper and the labels assigned in "Immigrant data.do".

All variables with names starting in "i_" are allocation flags, where: 0="Not missing", 1="Data from another country", 2="Interpolated or extrapolated", 3="Uses alternative series", 4="Missing but not allocated"


The program files consist of:

1. Data creation files
"Dataset code YYYY.do" � takes the original data files �Census YYYY data.dta� and produces �Census YYYY Husb data.dta�, �Census YYYY Wife data.dta�, �Census YYYY Single men data.dta�, �Census YYYY Single women data.dta�
"Immigrant file code.do" � takes �finaldata.dta�, �Country group codes.dta�, �Census YYYY Husb data.dta�, �Census YYYY Wife data.dta�, �Census YYYY Single men data.dta�, �Census YYYY Single women data.dta� and produces �Census YYYY Immigrant data.dta�
"Native file code.do" � takes �Census YYYY Husb data.dta�, �Census YYYY Wife data.dta�, �Census YYYY Single men data.dta�, �Census YYYY Single women data.dta� and produces �Census YYYY Native data.dta�
"Analysis.do" � takes �Census YYYY Immigrant data.dta� and �Census YYYY Native data.dta� and produces �Census YYYY Analysis data.dta�
"Regression dataset code.do" � takes finaldata.dta, �Country group codes.dta� and Census YYYY Analysis data� and produces �Regression dataset (married) (cts educ).dta�


2. Data analysis files
"Table_1.do" - takes �Regression dataset (married) (cts educ).dta� and produces the results in Table 1 of the paper
"Table_2.do" - takes �Regression dataset (married) (cts educ).dta� and produces the results in Table 2 of the paper
"Tables_3_4.do" - takes �Regression dataset (married) (cts educ).dta� and produces the results in Tables 3 and 4 of the paper
"Table_5.do" - takes �Regression dataset (married) (cts educ).dta� and produces the results in Table 5 of the paper
"Table_6.do" - takes �Regression dataset (married) (cts educ).dta� and produces the results in Table 6 of the paper